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We propose a network model with a fixed number of nodes and links with a dynamics which favors 
links between nodes differing in connectivity. Parameter regimes where the degree distributions 
follow power-laws, P(fc) ~ , high clustering following C{N) ~ 1/A'' and small- world properties, 
with a network diameter following D{N) A + B log A^, are observed. Our model gives results 
comparable with real-world protein networks. 
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I. INTRODUCTION 

Over the last few years a large number of network mod- 
els have been put forward, highly motivated by empirical 
studies of real-world networks. 

The various models can be categorized belonging to 
one of three main classes of modeling paradigms. First, 
different variants of the random graph model of Erdos 
and Renyi 0| are still used for comparison with many 
different models and empirical studies The second 
group of network models are refered to as small-world 
models, first presented by Watts and Strogatz Q and 
are motivated by high clustering observed in many real- 
world networks. This group of network models aims to 
include both the idea of highly clustered networks and 
random graphs. Third, the construction of various scale- 
free models have been motivated by the discovering of 
power-law degree distributions in real-world networks, 
ranging from the World Wide Web to the network 
of Science collaboration 5] and the web of human sex- 
ual contacts (QJ . This group of network models focuses on 
the dynamics of the network and aims to offer a universal 
theory of network evolution 0. 

In the past few years, a wide range of concepts and 
measures for complex networks have been proposed and 
investigated. However, complex networks are most often 
described by three basic concepts. 

The small-world concept describes the fact that there 
is a relative short path between any two nodes in most 
networks. The maximum of the shortest paths between 
any two nodes, refered to as the diameter, is often ob- 
served to grow logarithmically with the network size, N. 
This property is not related to a particular organizing 
principle 2], and are observed in random graphs, small- 
world model networks and scale-free networks. 

The clustering of a network is related to the formation 
of cliques of nodes being linked to each other. The clus- 
tering in most real networks is observed to be larger than 
the clustering in random graphs. Many proposed models 
of complex networks grasp this idea. 
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The third main characteristic for complex networks is 
the degree distribution. The degree distribution, P{k), 
gives the probability for a randomly selected node to be 
connected to k different other nodes. For a wide range of 
complex networks a power-law distribution P{k) ~ 
has been observed 0, S H 0,11 H Ell . This deviates sig- 
nificantly from random graphs where links are placed ran- 
domly and from small- world models. In random graph 
models and in small-world models a large number of 
nodes have a degree close to the average degree of the 
network, k. 

Over the last few years a wide range of protein net- 
works have been studied |0, 0, ll.lf . These networks 
are formed by direct physical interaction between pairs 
of proteins and they form the underlying structure for 
the propagation of various signals regulating the proteins 
[T3| . The motivation for the present study is the recent 
observation that protein networks guiding the biochem- 
istry of living cells, the placement of links tends to occur 
between high and low connectivity nodes rather than be- 
tween nodes of similar connectivity ^ll- The fact that 
one observe that the highly connected proteins are mostly 
connected to those with low connectivity, meaning that 
the highly connected nodes are well separated, is believed 
to increase the robustness of the networks . This ob- 
servation is somewhat reminiscent of the networks that 
connect people with respect to their professional special- 
ities. An example might be a network describing medi- 
cal relations in terms of physician-patients relations. A 
physician (a highly connected node) has many patients 
(low connected node), but does not have a physician- 
patient realtion to many other physicians. Patients do 
not have physician-patient relations with each other. The 
same argument goes for many networks describing rela- 
tions based on a certain profession or some sort of spe- 
cialization. 

In the next section, we describe how we construct 
networks with the property that contrasting nodes are 
preferably attached through links by using a Monte Carlo 
technique. We then go on to show the results of our 
model simulations, focusing on the three main charac- 
teristics of complex networks, degree distributions, clus- 
tering and small- world properties. Finally we construct 
randomized versions of our networks and calculate the 
ratio between the degree distributions from the original 
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network and the randomized version of the same network. 
These results are compared with resuhs from real-world 
protein networks. 



II. DESCRIPTION OF THE MODEL 

Rather than attempting to construct a network that 
preferably connects highly contrasting nodes through re- 
constructing the process that may naturally have devel- 
oped them, we take the Monte Carlo approach. Even 
though this is well-known and well understood technique, 
it is useful to remind the reader of its philosophy as 
we proceed. Given a probability distribution p{i,j) for 
having a link between nodes i and j, the Monte Carlo 
method constructs a biased random walk through the 
set of different network configuration such that the rel- 
ative frequency of encountering configurations with such 
a link present is proportional to p{i,j). This probability 
should not set any intrinsic scale for the network. The 
natural candidate for such a probability should then be 
a power law in the ratio ki/kj. Hence, we propose the 
simplest form that accentuates the contrast between the 
connectivity of the two nodes i and j without bringing 
any intrinsic scales into the problem. 
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This may be rewritten in a more compact form as follows, 
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The network we consider consists of TV nodes with L 
undirected links between them. We term a given config- 
uration of links between the nodes as Q. The probability 
to find a given configuration Q is then 
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We see that formally, the probability for finding a given 
configuration Q follows the Boltzmann distribution with 
a Hamiltonian defined in Eq. ||SJ). The parameter (3 has 
the formal appearence of an inverse temperature, but it 
should not be interpreted as anything more than the sin- 
gle remaining parameter in the probability when scale- 
freeness is implemented. 



We implement the Monte Carlo procedure using the 
Metropolis algorithm [T^ [T5| . a well-known al gori thm 
also previously used in different network models [l^ . In 
order to construct the random walk, we need the tran- 
sitional probabilities VlQ G') which have to obey de- 
tailed balance, V[g)V{Q G') = V{G')V{G' ^ G). The 
Metropolis prescription consists in first defining a set of 
neighborhood configurations. These are in our case sim- 
ply all configurations that can be reached from G mov- 
ing one link without placing two links between the same 
pair of nodes. The number of such neighboring states is 
C= [N[N-l)/2]\/{[N{N-l)/2-L\\L\). Next we define 
the partial transitional probability 'n{G G') — l/C. If 
we now have that 



T^iG G') = Tr{G' 
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the Metropolis construction of V{G G') from ti{G 
G') ensures that detailed balance is fulfilled. By construc- 
tion, relation © is fulfilled. The transitional probability 
is now given by 
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We emphasize at this point that it is essential that de- 
tailed balance to be fulfilled if the Monte Carlo algorithm 
is to produce configurations G with probability propor- 
tional with the prescribed V{G) — and that this is fully 
ensured once Eq. ((HJ is fulfilled. 



III. RESULTS 

In figure n we show examples of networks with iV = 50 
nodes and average connectivity fc = 4 for different val- 
ues of the parameter /3 when the networks have reached 
equilibrium after ~ iV^ iterations. Our model allows so- 
called single nodes, i.e., nodes that are not connected to 
any other nodes. It is also possible to split the network 
in disjoint components. 

In figure 121 we show the probability, P{k), for a node 
being linked to k different other nodes for different values 
of the parameter (3. For small values of /3, the network 
behaves essentially as a random network. However, as /3 
is increased, the importance of the contrast between the 
connectivity of each pair of nodes is increasingly accentu- 
ated. The effect of this is seen clearly in the diagrams for 
(3 < 0.8, where a power law appears. This is the regime 
where the model produces networks with the connectiv- 
ity properties described in the Introduction. Further- 
more, these networks are scale free, as power law in P{k) 
indicates. 
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FIG. 1: A network with N = 50 nodes with k — 4 for different inverse temperature /3 when the networks have reached 
equihbrium after ~ A'"'^ iterations. 



There is a phase transition in the model associ- 
ated with a. (3 = (3c- In order to investigate this, 
we study moments of the nodal distribution Piq^pik), 
k'^PM^pik) {n > 2), for different network sizes N . As 
N grows, Ylk k"PN,i3{k) [n > 2) plotted as a function of 
f3, converges towards a stepfunction with the step at crit- 
ical (3c- By looking at the slope of the step and plot the 
slope intersection with the /3-axis versus 1/A^ and finally 
extrapolating l/N —^ 0, the numerical value of ()c may 
be determined. The result of this analysis is shown in 
figure|3| and we find (3c — 0.60. 
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FIG. 2: The probability P{k) for a node to be linked to k 
different other nodes for different /3. These nodal distributions 
are averages for networks with N = 2000 nodes with fc = 4. 

Figure 21 shows the cluster coefficient for different val- 
ues of the parameter /? as a function of the network size, 
N. If we consider a single node i with connectivity fc^, 
which means ki neighbors, we calculate the cluster co- 
efficient Ci as i Ci = rrii /M^ . , where . is the high- 
est possible number of links i between fc^'s neighbors. 
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while rrii is the actual number of links 



FIG. 3: Slope intersection with the /3-axis for the stepfunction 
fc"Pjv,/3(fc) (n > 2) as function of inverse network size, 
Extrapolation gives a critical (3c = 0.60. 



between ki^s neighbors. The mean cluster coefficient for 
a given temperature and a given network size, C{(3,N) 
is the average of all these c^'s. For all values of /?, we 
have a decreasing cluster coefficient as a function of the 
network size N, C{(3,N) ~ N~'^ with uj close to 1 for 
all (3. We observe that the largest clustering is found for 
intermediate values of /?, close to (3c 

It is also possible to look at the average cluster coeffi- 
cient for a node with connectivity ki. These results are 
shown in figure [S] for different values of (3. For high /?, 
the model seems to show a power-law dependence for the 
clustering as a function of the degree, fc, c{ki) ~ fc"''', 
with 7 close to 3. This means that nodes with low con- 
nectivity are typically better clustered than nodes with 
high connectivity. For (3 = 0.0, we have an exponent 
7 = and this seems to be close to the situation for any 
(3 below the critical value. 

In many real-world networks one observes that there 
is a relative short path between any two nodes in the 
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FIG. 4: Mean cluster coefficient, C{N,P) as a function of 
network size for different f3 values. • : (3 — 0.0, * : f3 = OA, 
o : /3 = 0.8 and □ : /3 = 1.2. 



FIG. 6: Mean diameter, D{N), as a function of the network 
size, iV, for different (3. • : /3 = 0.0, ★ : /3 = 0.4, o : /3 = 0.8 
and □ : /3 = 1.2. 



FIG. 5: Cluster coefficient as a function of the degree, k, for 
different /3 values. 



network. The maximum of the shortest paths between 
any two nodes in a network is most often refered to as 
the diameter of the network. In figure|^we plot the mean 
diameter D{N) as a function of the network size, N, for 
different /3. We observe small-world properties with the 
diameter growing logarithmically with the network size, 
iV. Typically, we also see a growing diameter for lower 

A good illustration of connectivity correlations is to 
compare the network to a randomized network where the 
nodes have the same connectivity as in the original net- 
work but with randomized links. If P(fco, fci) denotes the 
probability for a node with connectivity fco to be linked 
to a node with connectivity ki and Pr{kQ,ki) denotes 



the same probability in the randomized network, then 
the ratio P{ko, ki) / Pr{ko, ki) is an interesting measure 
for the connectivity correlations which can extract some 
characteristics of the network. Figure [7| shows this ra- 
tio for two values of (3, on either side of the critical (3c, 
(3 = 0.4 and (3 — 0.8. Because our network consists of 
undirected links, we have a symmetry around fco = fci. 
We see regions in the ko — fci-plane where connections 
between nodes with certain connectivities either are sig- 
nificantly enhanced or suppressed compared to random- 
ized networks. The darker blue area around k^ = ki, 
reflects the tendency that it is less likely for two nodes 
with connectivities not differing much, to be connected. 
Along the fcg- or fci-axis and close to these axis, we see 
a increased probability that nodes differing significantly 
in connectivity are connected. The same ratio for real- 
world protein networks show richer patterns |13| , but it is 
possible to recognize some tendencies comparing it to our 
results. In areas around ko = ki we observe a reduced 
probability for two nodes with equal or close to equal 
connectivity to be connected, and we see an increased 
probability for nodes differing much in connectivity to 
be connected, observed close to the kg- and /ci-axis 



IV. SUMMARY AND CONCLUSION 

In this paper we have presented a network model with 
a static number of nodes and a static number of undi- 
rected links. Our model favors links between nodes dif- 
fering in connectivity. We observe a series of character- 
istics observed in real- world networks. For small values 
of (3, the network behaves essencially as a random net- 
work. As P increases, the importance of the contrast 
between the connectivity of each pair of nodes increases, 
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FIG. 7: Connectivity correlations. P(A;o, fci)/Pr(fco, ^i) where 
P{ko, ki) is the probability for two nodes with connectivities 
fco and fci are connected, while Pr{ko,ki) is the same proba- 
bility in a randomized version of the same networks. In the 
randomized network, the nodes have the same connectivity as 
in the original network, but the links have been randomized. 



and we observe scale free degree distributions indicated 
by power laws. We also observe a phase transition at 
(3 — Pc = 0.60. Our model gives networks with relative 
high clustering and a cluster coefficient decreasing with 
increasing network sizes as C{N) ~ A^~^. We observe 
the largest clustering for intermediate values of /3 close 
to f3c- The small- world property in our network model 
is indicated by a diameter growing logarithmically with 
the network size, D{N) ^ A + B\ogN. The diameter de- 
creases for increasing values of /3. Finally we constructed 
randomized versions of our networks in order to compare 
our results with real-world protein networks. Patterns 
in the connectivity correlation plot, P{ko, ki) / Prik^, ki), 
for /?- values on either side of the critical value for /3, 
show similarities with patterns from real-world protein 
network. Connections between nodes with connectivities 
differing significantly are enhanced while connections be- 
tween nodes with equal or almost equal connectivities are 
suppressed compared to randomized versions of the same 
networks. 

We would like to thank Kim Sneppen at NORDITA 
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